Mining Projects from Structured and Unstructured Data

نویسنده

  • Saimir Bala
چکیده

Companies working on safety-critical projects must adhere to strict rules imposed by the domain, especially when human safety is involved. These projects need to be compliant to standard norms and regulations. Thus, all the process steps must be clearly documented in order to be verifiable for compliance in a later stage by an auditor. Nevertheless, documentation often comes in the form of manually written textual documents in different formats. Moreover, the project members use diverse proprietary tools. This makes it difficult for auditors to understand how the actual project was conducted. My research addresses the project mining problem by exploiting logs from project-generated artifacts, which come from software repositories used by the project team.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Recent Survey on Unstructured Data to Structured Data in Distributed Data Mining

The organization of unstructured data is recognized as one of the major uncertain problems in the information industry and data mining paradigm. It will be in the form of computerized information that moreover, does not have a data model and there are not simply used by data mining. The task of managing unstructured data signifies possibly the major data management opportunity for our community...

متن کامل

Text Mining and Site Outlining Projects

2 Knowledge discovery from a large amount of unstructured or semi-structured text (KDT) has been quickly forming a major research trend. In particular, it has become extremely important for customer relationship management (CRM) and business intelligence (BI) applications since KDT will be able to go beyond conventional demographic and stochastic analysis of databases, and focus on textual info...

متن کامل

Comparison of Structured vs. Unstructured Data for Industrial Quality Analysis

Industrial methods for quality analysis massively rely on structured data describing product features and product usage. The analysis of such data is normally done using complex reporting or sophisticated data mining methods. Besides this structured data, companies very often also posses large amounts of unstructured text like call center reports, internet fora or repair order documents. Despit...

متن کامل

Knowledge Networks of Biological and Medical Data: An Exhaustive and Flexible Solution to Model Life Science Domains

The huge amount of unstructured information generated by academic and industrial research groups must be easily available to facilitate scientific projects. In particular, information that is conveyed by unstructured or semistructured text represents a vast resource for the scientific community. Systems capable of mining these textual data sets are the only option to unveil the information hidd...

متن کامل

In-depth Interactive Visual Exploration for Bridging Unstructured and Structured Document Content

Semi-structured data refers to the combination of unstructured and structured data. Unstructured data is free text in natural language, while structured data is typically stored in tables and following a data schema. Recent statistics shows that 80% of the data generated in the last two years is unstructured. However, one interesting observation is that free text usually comes along with some s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017